f  l.J 

AD-A074  839 

UNCLASSIFIED 

I  OF  | 


*20748*8 


AEROSPACE  CORP  EL  SEGuNDO  CA  ENGINEERING  GROUP  F/6  9/4 

AN  IMAGE  TRANSFORM  CODING  ALGORITHM  BASED  ON  A  GENERALIZED  CORR— ETC(U) 
AUG  79  S  M  MELZER  F04701-78-C-0079 

TR-0079(4901-03)-6  SAMSO-TR-79-58  NL 


END 

DATE 

FILMED 

11-79 


DOC 


An  Image  Transform  Coding  Algorithm 
Based  on  a  Generalized  Correlation  Model 


,  S.M.MELZER  , 
Engineering  Group  - 
The  Aerospace  Corporation 
El  Segundo,  Calif.  90245 


24  August  1979 


Interim  Report 


APPROVED  FOR  PUBLIC  RELEASE 
DISTRIBUTION  UNLIMITED 


Prepared  for 

SPACE  AND  MISSILE  SYSTEMS  ORGANIZATION 
AIR  FORCE  SYSTEMS  COMMAND 
Los  Angeles  Air  Force  Station 
P.O.  Box  92960,  Worldway  Postal  Center 
Los  Angeles,  Calif.  90009 


This  interim  report  wee  submitted  by  The  Aerospace  Corporation, 

El  Segundo,  CA  90245,  under  Contract  F04701 -78-C-0079  with  the  Space 
and  Missile  Systems  Organization  (AFSC),  Los  Angeles  Air  Force  Station, 
P.O.  Box  92960,  Worldway  Postal  Center,  Los  Angeles,  CA  90009.  It 
was  reviewed  and  approved  for  The  Aerospace  Corporation  by  D.  J.  Griep, 
Engineering  Group.  Lieutenant  J.  C.  Garcia,  DYXT,  was  the 
Deputy  for  Technology  project  engineer. 

This  report  has  been  reviewed  by  the  Information  Office  (OI)  and  is 
releasable  to  the  National  Technical  Information  Service  (NTIS).  At  NTIS, 
it  will  be  available  to  the  general  public,  including  foreign  nations. 

This  technical  report  has  been  reviewed  and  is  approved  for  publication. 
Publication  of  this  report  does  not  constitute  Air  Force  approval  of  the 
report's  findings  or  conclusions.  It  is  published  only  for  the  exchange  and 
stimulation  of  ideas. 


_ UNCLASSIFIED _ 

SECURITY  CLASSIFICATION  or  THIS  RAPE  fWh«n  Dtp  Bnlorod) 

f/^EPORT  DOCUMENTATION  PAGE 

I  - _  |2.  GOVT  ACCESSipN  No! 

*  SAMS^jTR-79-58  ) _ Q?JC 

«.  TITLE  (t>d  Subtitle)  - - — _ _ 

>  ).AN  JM AGE  .TRANSFORM  jCODING  ALGORITHM 
<  BASED  ON  "A  GENERA  UZED  CORRELATION 

MODEL. _ _ _ — - ""  f, 


)  Stuart  M .jM el zer ^  V'A 

9  PERFORMING  ORGANIZATION  NAME  ANO  AOORESS 

The  Aerospace  Corporation 
El  Segundo,  Calif.  90245 

II.  CONTROLLING  OFFICE  NAME  AND  AOORESS 

■  Space  and  Missile  Systems  Organization/ DYXTVj 
P.O.  Box  92960,  Worldway  Postal  Center  -. 

Los  Angeles.  Calif.  90009 _ Q, 

T7  MONITORING  AGENCY  NAME  »  ADDRESS (It  different  from  Controlling  Office) 


I  IS.  DISTRIBUTION  STATEMENT  (ol  i hie  Report) 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 

1.  RECIPIENTS  CATALOG  NUMBER 

/■  X  ■*  r  s  d  ■ 


Interim 


BER 

V)  TR  -,0079(4901 -03)- 6  J 

~)  Fj»47^1  -  78 -C -01079? 

10.  PROGRAM  ELEMENT.  PROJECT. 
AREA  A  WORK  UNIT  NUMBERS 

TASK 

ll  4  Aug«*  B?79  ) 
rr^MRiBwi  ns fi — * - 

W)  _ _ _ 

rr  SECURITY  CLASS,  (of  thl •  roport) 

Unclassified 


15#.  DECLASSIFICATION/ DOWNGRADING 

schedule 


Approved  for  public  release;  distribution  unlimited. 


\T7.  DISTRIBUTION  STATEMENT  (of  th»  mbrtract  ontorod  in  Block  20,  It  dlfforont  from  Roport) 


18.  SUPPLEMENTARY  NOTES 


Ilf.  KEY  WOROS  ( Continue  on  rovorto  §ldo  If  nocooomry  md  Identity  by  block  numbor) 


Image 

Transform 

Coding 


Statistics 

Bits 


fcj  ABSTRACT  (Contlnuo  on  rororoo  oldo  II  noeooomry  and  IdonUty  by  block  numbor) 

Image  transform  coding  is  a  technique  whereby  a  sampled  image  is  divided 
into  blocks.  A  two-dimensional  discrete  transform  of  each  block  is  taken, 
and  the  resulting  transform  coefficients  are  coded.  Coding  of  the  transform 
coefficients  requires  their  quantization  and,  consequently,  a  model  for  the 
transform  coefficient  variances  that  is  based  in  turn  on  a  correlation  model 
for  the  image  blocks.  In  the  proposed  correlation  model  each  block  of  image 
data  is  formed  by  an  arbitrary  left  and  right  matrix  multiplication  of  a  sta¬ 
tionary  white  matrix.  One  consequence  of  this  correlation  model  is  that  the  — 


DO  ru""  1471 

IF  A  CSlMl  Li  I 


i/i  £  2  7? 


_ UNCLASSIFIED  f  /*  t 

1 E CURlTV  CLASSIFICATION  OF  TMtS  PAGE  (WhenDte  Entered) 


UNCLASSIFIED 


MCUWTV  CLASSIFICATION  OF  THIS  FAOBfOMa  Ma  (Man« 


SICUftITV  CLASSIFICATION  OF  TNIS  FAM<*M»  DM*  BnNfMO 


PREFACE 

The  author  would  like  to  thank  Drs.  R.  V.  Cox 
and  A.  G.  Tescher  for  the  use  of  their  input- 
output  subroutines  and  Mr.  C.  H.  Garcia  for 
his  programming  support. 

This  report  was  prepared  as  a  paper  for 
publication  in  the  Proceedings  of  the  Society 
of  Photo -Optical  Instrumentation  Engineers, 
Vol.  149*  Application  of  Digital  Image 
Processing,  August  1978. 


1 


CONTENTS 


INTRODUCTION . 

STATISTICAL  MODEL  THEORY  . 

The  Transform  Statistics . 

Rate  Distortion  Results  . 

DESCRIPTION  OF  THE  ALGORITHM . 

Coding  of  the  Transform  Statistics . 

Bit  Allocation  Rules  for  the  Transform  Coefficients 

PERFORMANCE  EVALUATION . 

Performance  Equations  . 

Test  Results . 

CONCLUSIONS . 

REFERENCES  . 


FIGURES 


Basic  Structure  of  Statistical  Transform  Coding 
Local  Bit  Control  for  Weather  Image  1 
Global  Distortion  Control  for  Weather  Image  2 
Global  Distortion  Control  for  Site  Image 

TABLES 

Ustog^MoCo?"1?  !°T. COdi"8  o£  StaU>tlc»1  Coefficient, 

Image  Specifications  . 

Performance  Results . 


11 

11 

12 

12 


13 

13 


f 


4 


AN  IMAGE  TRANSFORM  CODING  ALGORITHM  BASED  ON  A  GENERALIZED  CORRELATION  MODEL* 


Int  roduction 

l  ransform  coding  i a  an  efficient  technique  for  compressing  coded  image  arrays  from  a  high  number  to  a 
low  number  of  bits  per  pixel  because  the  transform  concentrates  the  image  energy  in  the  lower  index  trans¬ 
form  coefficients.  The  developed  algorithm  discussed  in  this  paper  is  within  the  framework  of  statistical, 
adaptive  transform  coding  in  that  the  transform  coefficients  are  quantized  with  a  quantizer  and  bit  allocation 
based  on  the  transform  coefficient  variances.  The  term  adaptive  transform  coding  is  applicable  because 
the  transform  coefficient  variances  for  each  block  are  estimated  from  the  transform  coefficients  of  that 
block.  Recomputation  of  the  transform  coefficient  variance  estimates  for  each  block  allows  these  variance 
estimates  to  adapt  to  the  statistical  variations  between  blocks.  This  latter  property  is  also  found  in  the  re¬ 
cursive  estimation  of  the  transform  coefficient  variances  as  proposed  by  Tescher  and  Cox  Adaptive 
image  coding  techniques  have  been  recently  surveyed  by  Habibi.fc)  The  algorithm  developed  in  this  paper 
is  new  in  the  choice  of  model  for  the  transform  coefficient  variances. 

Statistical  Model  Theory 

An  image  array  is  divided  into  N\N  matrices  called  blocks.  For  the  purpose  of  generating  second-order 
statistics,  the  fundamental  assumption  made  in  this  paper  is  that  each  block  is  formed  by  an  outer  product 
matrix  multiplication  on  a  zero  mean  stationary  white  matrix: 

X  HWGT  ;  Trace  (HTH)  =  Trace  (GTG)  =  N  (I) 

In  Eq.  (I).  X  is  the  NxN  matrix  of  image  data  for  one  block.  H  and  G  are  NxN  scaled  matrices,  and  W  is 
an  N  *  N  stationary  white  matrix. 

E[w  (k.iljso4*  ;  E|w(k,  i )w(r,  s )  1  0  for  k,i  #  r,s  {l) 

In  Eq.  (*!).  w(k,  i)  is  the  k.i  element  of  W  and  e[*J  is  the  statistical  expectation  operator. 

The  model  proposed  here  is  an  extension  of  the  model  typically  assumed  for  generating  the  second-order 
statistics  of  random  vectors,  i.e.,  that  the  random  vector  is  formed  by  a  matrix  multiplication  on  a  sta¬ 
tionary  white  vector.  The  best  known  special  case  that  fits  the  model  [Eq.  (l)j  is  that  X  is  a  block  from  a 
two-dimensional  exponentially  correlated  image. 

It  follows  from  Eq.  (2)  that,  for  an  arbitrary  N^N  matrix  Q, 

ElwTQW)  =  E|WQWTJ  =  (Trace  Q)o2l  (31 

where  I  is  the  NxN  identity  matrix.  It  follows  from  Eqs.  (11  and  (3)  that 

o"  Trace  (e1\XT/N2|)  (4) 

ElxxT/Nc“|  imT  :  e(xTX/Nc<21  =  GGT  ($) 


This  work  reflects  research  conducted  under  IL  S.  Air  Force  Space  and  Missile  Systems  Organisation 
(SAMSO)  Contract  No.  F04 701  - 77-C -0078. 


The  matrix  ElxX^/No“]  reflects  the  row  correlations  in  X,  and  the  matrix  ElX  ' X /No" J  reflects  the  column 
correlations  in  X. 

The  Transform  Statistics 

The  transform  of  the  matrix  X  is  the  NxN  matrix  Z: 

Z  =  UTXV  ;  U'1  =  UT  .  V'^V1"  (6) 

In  Eq.  (6),  U  and  V  are  arbitrary  unitary  matrices  and  it  follows  that,  given  Z,  then  X  =  UZV^.  Subse¬ 
quently,  we  investigate  the  second-order  statistics  of  the  transform  coefficients  z(n,  /).  From  (6) 

z(n,  f  )  =  UTXV .  (7) 

— n  —  l 

where  Un  is  the  n^1  column  of  U  and  is  the  column  of  V.  From  Eqs.  (1),  (3),  and  (7),  it  follows  that 

E[z(n,  Oz(r.  s  )J  =  o2(u^HHTUr)(  V  JgGT\^)  (8) 

Substitution  of  Eq.  (5)  into  Eq.  (8)  gives 


E[z(n,  f)z(r,  s)]  =  o2(ujElXXT/No2lUr)(vjE(xTX/No2lv/)  (9) 

What  is  learned  from  Eqs,  (4)  and  (9)  is  that  the  two  matrices  E[XX^]  and  E[X^x]  constitute  sufficient  sta¬ 
tistics  for  the  determination  of  the  second-order  statistics  of  the  z(n,  l)  transform  coefficients.  Moreover, 
if  UTe[XXt]U  is  the  unitary  transformation  that  diagonalizes  EtxX^j  and  if  V^e[X^X]V  is  the  unitary  trans¬ 
formation  that  diagonalizes  E{X^Xj,  then  the  transform  coefficients  are  uncorrelated  with  E[z(n, <)z(r,  s)]  =  0 
for  (n,f)  *  (r,  s).  In  this  case,  Z  =  U^XV  is  the  Karhunen -Loeve  or  Hotelling  transformation  for  X. 

From  Eq.  (9)  it  follows  that  the  variance  of  the  (n, /)  transform  coefficient  is 

a2  ,  £  E[z2(n,  #)J  =  aZ\  B,  (10) 

n,  f  n  / 

X  £  U^E[xXT/No2lU  ;  0,  £  vTe[xTX/No2]V,  (11) 

n  — n  —  n  I  — t  —  l 

What  is  learned  from  Eq.  (10)  is  that  the  transform  coefficient  variances  are  product  separable  in  row  and 
column  indexes.  Since  XX~  and  x"^X  are  positive  semidefinite  matrices  and  since  U  and  V  are  unitary,  it 
follows  that 


N  N 

X  2  0  >  £  X„  =  N  ;  9.  a  0  ,  £s  =N  (12) 

n  n  =  1  n  1  f=  1  1 

From  Eqs.  (10)  and  (12),  a2,  X^.  and  can  be  expressed  directly  in  terms  of  the  E[z2(n,  I)]: 

<?2  =  £  £  E[z2(n,/)]/N2  (13) 

f=ln=l 

N  -  ,  N  ,  , 

X  =  ^EU^n./jJ/Na  ;  9,  =  £  Efz^n,  f)]/Na  (14) 

n  f=l  1  n=  1 

The  results  of  applying  the  rate  distortion  theory  to  random  vectors ^  carry  through  analogously  to  ran¬ 
dom  matrices  modeled  by  Eq.  (1).  If  the  elements  of  W  are  Gaussian,  then  the  least  number  of  bits  per  ele¬ 
ment  B  required  to  block  code  X  with  mean  square  distortion  D  per  element  is 

B  =  0,5  log2((j2/D>  -  (0.  S/Nlflog^DetRj1)  +  log^DetR^1)]  (15) 

Rj  £  E[xXT/No2]  and  R2  4  E[xTX/No2]  (16) 

In  Eq.  (15),  D/o^  is  assumed  to  be  less  than  the  product  of  the  smallest  eigenvalue  of  Rj  times  the  smallest 
eigenvalue  of  R£  with  Rj  and  R^  restricted  to  be  positive  definite.  The  proof  is  discussed  below  with  b(k,  i), 
b(k,  i),  z(n, /)  and  z(n, /)  denoting  the  elements  of  X,  X,  Z  and  Z  respectively  and  X  =  UZVT. 
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Rate  Distortion  Results 

The  block  mean  square  distortion  is  defined  as  D-j-  2.2. E[(b(k,  i )  -  b(k,i)|‘"|.  Since  U  and  V  are  unitary, 
it  follows  that  Dj  2.2JEl(z(n,  I)  -  2(n,  /))-).  Each  transform  coefficient  z(n,  I)  is  coded  with  Un#  i  bits  and  de - 
coded  as  z(n,  f)  with  mean  square  distortion  Dn>  /  El(z(n,  f)  -  z(n.  Thus,  Of  1.1.1 )n>/.  We  next  as¬ 

sume  that  the  elements  of  W  in  the  model  given’ in  (1)  are  Gaussian,  and  it  follows  that  the  z(n, /)  are 
Gaussian.  The  minimum  bit  allocation  Bn<  /  to  achieve  the  mean  square  distortion  Dn,  f  *  i  for  Gaussian 
z(n,  /)  is  Bn>  i  0.  S  log^jj.  ft Dn,  /).  The  total  bit  allocation  for  the  block  is  Bj  lilBn,  t  and  it  follows 
that 


B 


T 


O.SEElog^ 


2 

n,  / 


0.  52.'2Uog  ,D 
*2  n 


t 


(17) 


To  minimize  Bt  for  a  given  Dt.  we  choose  Dn,  /  to  maximize  2.2Tog>Dn>  /.  The  solution  is  Dn,  t  Dt/N1- 
provided  D^/N-  "  -ft.  /  for  all  indexes.  Considering  the  case  where  t)iis  last  inequality  is  satisfied,  then 


iT  0.  Slilllog^o^  f  -  0.  £>N“log^(DT/N‘ 


(18) 


Denoting  D  Dx/M“  as  the  normalized  distortion,  B  Bt/N-  as  tlie  normalized  bit  allocation,  and  sub - 
itituting  (  o^Vn8f,  it  follows  that 


B 


N 

Elo*z8, 

it 


(19) 


T 

From  (11)  and  (lb)  the  \n  are  the  main  diagonal  terms  of  U  RjU  and  the  8/  are  the  main  diagonal  terms  of 
V^R^V.  The  term  B  is  minimized  when  II\n  and  118/,  which  are  the  respective  products  of  these  main 
diagonals,  are  minimized.  This  occurs  when  Z  U^XV  is  the  Karhunen- Loeve  Transform  for  X.  Then  the 
\n  are  the  eigenvalues  of  Rj  and  the  8/  are  the  eigenvalues  of  R>.  It  follows  that  Xlog^Xn  log^(M\n)  - 
log  »(DetR |)  and  similarly  for  R^.  Substitution  into  (19)  gives 

B  0.5  log2(ca/D)  +  (O.S/Nltlog^DetRj)  +  log  ,(DetR  ,)]  (20) 

Equation  (lb)  follows  since  DetRj*  =  (DetRi)'l  and  similarly  for  R?.  Also,  the  restriction  on  D  is 
D/q^  5  Xn8/  for  all  n,  t. 


Description  of  the  Algorithm 

The  basic  structure  of  statistical  transform  coding  algorithms  is  given  in  Fig.  1.  The  original  image  is 
a  large  dimensional  array  whose  elements  are  each  coded  using  Bc  bits,  and  the  images  in  our  study  have 
elements  that  take  on  integer  values  from  0  to  2^o  _  The  first  step  is  to  divide  the  image  into  NxN  blocks 
with  the  choice  in  this  study  of  N  ib.  The  16  xi6  block  size  is  large  enough  to  take  advantage  of  the  com¬ 
pression  of  energy  into  the  lower  index  transform  coefficients  and  small  enough  to  permit  consideration  of 
statistical  variations  over  the  image.  Next,  the  two -dimensional  discrete  cosine  transform  operation  U  -  V, 
Z  UTxU  is  performed.  The  elements  u(n,  0  of  the  NXN  unitary  matrix  U  are 

u(n,  1)  =  1A/N  ;  n=i,  2  •  •  -N  (21) 


u(n,  t) 


Jhcoa[ 


(f  -  l)ir(2n 
2N 


n  1.  2 
/  =2,  3 


N 

N 


(22) 


The  cosine  transform  is  a  popular  choice  because  of  the  closeness  to  the  Karhunen  -  Loeve  Transform  for 
exponentially  correlated  images.)'*'  In  this  study  no  fast  Fourier  Transform -type  algorithm  is  used.  The 
cosine  transform  Z  U^XU  and  the  inverse  X  -  UZU^  are  computed  as  written  -  by  matrix  multiplication. 


Unlike  the  assumption  made  in  the  theoretical  discussion,  the  real  image  blocks  X  do  not  have  zero  mean. 
For  the  two-dimensional  cosine  transform,  only  the  z(l,  1)  transform  coefficient  is  biased.  Using  (21) 


N  N 

2(1.  1)  =  E  E  *>(k.i)/N  (23) 

i=l k=l 

Bc 

where  b(ly  i)  is  an  element  of  X.  Since  each  b(k,  i)  is  an  integer  on  0  to  2  -  1,  then  Nz(l.  1)  is  an  integer 

on  0  to  N^(2“o  -  1)  that  requires  less  than  2  log^N  +  B0  bits  to  code  without  error.  In  this  study,  z(l,  1)  is 
decoded  exactly  (i.e.,  2(1,  1)  =  z(l,  I))  with  the  required  number  of  bits  included  in  the  bit  count. 

In  the  next  step,  the  \_,  and  8/  transform  statistics  are  estimated  for  each  block  using  that  block's 
transform  coefficients  to  replace  Elz^fn.  /)]  in  (13)  and  (14): 


7 


( ■  V 


N  N  , 

£  V  s~(n,  l)/N" 

I  I  n 1 


V' 


n 


rX  2  ,  > 

£  *  (n.  M/N(c 

I  1 


£  • 

£  i“(n.  n/Nt  -  V 

n  1 


(/‘H 


J  »  > 

In  Eqz.  (24)  Anti  (25),  (  '*)  .  \*  .  Ami  S'  denote  tin*  ostinutos  of  ?  ,  \  ,  and  ?  .  Also,  the*  actual  z**(I,  1)  is 
replaced  by  zero.  n  f  n  1 

The  estimates  of  the  block  transform  statistics  must  he  coded  since  they  are  require*!  for  obtaining  the 
decoded  transform  coefficients .  Let  5W,  \,^.  and  •/  denote  the  quantized  ('')*’.  \*n,  and  S*(  respectively.  Fhen 
the  transform  coefficient  quantized  standard  deviation  is 


c» 

n. 


( 2o ) 


Each  z(n.  I)  with  the  exception  of  z(l.  1)  is  normalized  by 


v’n,  (  1°  obtain  the  normalized  transform  coefficient 


X(n,  n  z(n.0/3n  t  (27) 

The  normalized  transform  coefficient  then  quantized  using  a  ll  ^  I*it  unity  variance  Max  -  Lloyd^  ^  Gaussian 
quantizer.  The  choice  of  fttl>  f  is  based  upon  *n  f  as  discussed  later. 

Decoding  the  bit  word  for  the  quantized  normalized  transform  coefficient  yields  X(n.  f).  which  is  multiplied 
by  *x\t  f  to  obtain  the  decoded  transform  coefficient 

z(n.  /)  5^  ^X(n.  f)  (28) 


The  some  3n>  t  mu*t  be  used  in  (27)  and  (28),  which  is  why  ("')•-,  \'  and  must  be  quantized  and  included 
in  the  bit  stream.  Finally,  the  matrix  of  decoded  transform  coefficients  ?.  [z(n.  f)J  is  inverted  to  obtain 

the  decoded  image  block  $  [6(k,  i)|: 

X  UZUT  (.191 

Coding  of  the  Transform  Statistics 

Since  *p,t  i*  used  to  normalize  z(n,  f),  accurate  quantizing  of  the  transform  statistics  means  that  (5/z'), 
(^n  /^n'*/‘‘»  atlt*  (5//8|)^  -  are  near  1.  It  follows  that  the  transform  statistics  should  be  logarithmically 
quantized.  For  (?)-,  the  output  choices  used  are  1,  C.  C**  •  •  •  CK-I  where  K  is  the  number  of  choices. 

The  number  of  bits  required  to  code  (V)*-  is  logj»K  and  is  c vaunted  as  overhead.  For  (j,/)“  between  the 
smallest  and  largest  output  choices. 


( 5  ^  cINTllog2(c»,r  /log  >c  ] 

where  INTI  I  is  used  here  as  the  closest  integer  to  the  argument. 

The  primary  contribution  to  overhead  is  the  coding  of  the  2N  statistical  coefficients.  \'«  •  •  •  \'N  and 
3  1  •  •  •  It  we  allocated  H  bits  to  each  coefficient,  then  the  overhead  is  2NB/N*"  2R/N  bits  per  pixel. 

The  coding  of  \'n  for  N  In  is  as  follows.  From  (24)  and  (25)  >  0  and  lb,  and  it  follows  that 

^  5  *  ®t>*  The  eight  output  level  choices  of  l/ln.  1/8,  1/4.  1/2,  1,  2,  4,  and  8  are  chosen  for  \n  j 

the  logarithmic  quantization  rule  is 


(30) 


pixel 

and 


•i>oS,vi 


1  /  in  <  \  <8 


I /lt>  for  V 


1/lt'  and  V  8  for  V  >  8 
n  n 


(Jn 


The  Mmr  approach  is  used  t»>  Cixlr  the 


If  the  same  number  of  hita  is  allocated  to  each  of  the  8  choices,  then  each  coefficient  is  allocated  '  hits 
and  the  resulting  overhead  for  coding  all  3/  statistical  coefficients  is  (Jl(3)/lt>  0.378  hits  per  pixel.  How¬ 

ever,  since  the  transform  tends  to  concentrate  block  energy  in  the  lower  index  transform  coefficients,  the 
\n  obtained  from  (311  will  tend  to  decrease  with  increasing  index  n.  Thus,  for  each  index  n  of  \n  there  are 
some  choices  more  likely  to  be  selected  than  others.  For  example.  \j  is  nearly  always  4  or  8.  The  impli¬ 
cation  is  that  \\e  can  improve  on  the  overhead  of  0.378  hits  per  pixel  for  the  3.1  statistical  coefficients  by 
using  a  variable  word  length  code. 


t 
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A  Fano  code  was  tried  that  uses  code  words  of  1  to  7  bits  for  the  eight  choices.  The  code  words  in  the 
order  of  decreasing  probability  are  0,  1-0,  1  - 1  -0,  1-1-10,  11-1-1-0,  1-1-1-11-0,  1  - 1  - 1  - 1  -  1  - 1  - 0,  and 
1  -  l-  l-  l-  l-  l-  l.  The  most  probable  \  is  coded  0,  the  second  most  probable  is  coded  1-0,  and  so  on. 
Because  the  sequence  tends  to  be  correlated,  the  order  of  probability  for  \n  was  set  from  \n_| 

for  n  2,  3  ...  lo  with  the  order  fixed  for  \j.  The  rule  used  for  setting  the  order  of  probability  is  given  in 
Table  1.  The  lowest  overhead  obtainable  with  this  scheme  is  0.  12^  bits  per  pixel  for  all  32  coefficients. 
There  are  other  methods  for  quantizing  the  that  might  be  tried,  including  differential  pulse  code  modula¬ 
tion  on  the  sequence  log  Aj  •  •  •  log^V|(j,  one -dimensional  transform  coding  of  this  sequence,  and  curve 
fitting  to  a  finite  parameter  model  of  this  sequence. 


Bit  Allocation  Rules  for  the  Transform  Coefficients 


The  number  of  bits  I3n  /  allocated  to  code  X(n,  /)  is  determined  from  the  bit  allocation  rule.  A  noninteger 
possible  negative  Bn  /  is  computed  and  then  set  to  the  closest  positive  integer  or  zero: 


B  ,  =  HMT[B  J  (or  B  >0  and 
n,  f  n,  t  n,  I 


ft  =  0  for  B  ,  I  0 
n,  f  n,  r 


(3-J) 


The  rules  apply  to  all  X(n,  I)  with  the  exception  of  n,  I  1,1.  As  discussed  earlier,  z(  1.  1 )  is  error-free 
coded  using  ft(t  j  =  l  log^N  *  Bu  8  +  Bo  bits.  For  the  X(n,  I),  if  Bn<  /  0,  then  X(n,  f)  =  0  is  decoded. 

The  first  allocation  rule  is  called  here  "local  bit  control"  (LBC)  and  is  used  to  allocate  approximately  the 
same  number  of  bits  to  each  block.  The  rule  is 


n,  l 


bt/n‘ 


,  r N  N  >  i 

(0.5/N  )  £  £  log  ,5; 

Lr  in  l  “  n'  J 


0.  5  log  S2  , 

n.  I 


(33) 


Note  that  /  B-p  but,  since  Bn>  i  0  for  Bn>  /  <  0,  then  EEBn  (  will  be  somewhat  greater  than  Bj. 

This  rule  tends  to  keep  the  allocations  for  the  blocks  about  the  sa  ?,  but  results  in  block  distortions  that 
differ  from  block  to  block  reflecting  their  statistical  variations. 


The  second  allocation  rule  is  called  here  "global  distortion  control"  (GDC)  and  is  used  to  maintain  approx¬ 
imately  the  same  block  distortions  for  each  block.  This  rule  is 


Bn,(;0'51oM3il/Dl  <34 

Note  that  E EB„  [  0.5EElog  >3^,  i  -  O.SN-log^D.  The  bit  allocation  will  vary  between  blocks,  reflecting  the 
differences  in  block  quantized  transform  statistics. 

Performance  Evaluation 


Performance  Equations 

The  performance  of  the  algorithm  is  evaluated  using  a  normalized  mean  square  error  (MSE)  criterion. 
The  MSE  for  the  entire  image  is  defined  by 


MSE 


v  DST 
-  ENGY 


(35) 


where  DST  is  the  distortion  per  pixel  and  ENGY  is  the  energy  per  pixel.  The  distortion  per  pixel  is  defined 
by 


.  M  N  N  - 

DST  £  -J—  £  £  £  (b  (k.i)  -  b  (k.i))  (3o) 

N“M  rdi  lk  1  r  r 

where  r  is  the  block  index  out  of  M  total  blocks,  br(k,i)  is  the  input  image,  and  br(k,  i)  is  the  output  image. 
Since  the  transform  is  unitary,  DST  can  be  expressed  in  terms  of  the  transform  coefficients  by 

.  M  N  N  , 

dst  -  — y—  £  £  £  ujn.  o  -  in,  m  (37) 

N  M  r=U  1  n  =  l  r 


The  energy  per  pixel  is  defined  by 

M  N  N  , 

ENGY  £  £  £  (b  (k.i)  -  m)  (38) 

N  M  r=  1  i=  1  k=  1  r 


where  m  is  the  image  global  sample  mean. 
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.  M  N  N 

nO-j—  £  £  £b(k,i)  (39) 

N  M  r  1  i  1  k  1 

Note  from  (35)  and  (38)  that  the  MSE  is  computed  here  with  the  global  sample  mean  extracted. 

I  he  bits  per  pixe.  that  is  used  to  achieve  the  MSE  is  13  a  3-  Bg  where  B^  is  the  bits  per  pixel  used  to  code 
the  transform  coefficients  and  Bg  is  the  bits  per  pixel  useu  to  code  the  transform  statistics.  The  relation  for 

BA  is 

M  N  N 

BA  -7-  £  E  L'  B„  ,(«■)  (40) 

N  M  r  1  f  1  n  1  ’ 

In  (40),  B  r )  is  the  l>it  alloeatitm^for  tin*  n,  f  transform  coefficient  of  block  r.  As  discussed  earlier, 

Bt.  1  ( r )  K  f  B„  bits  and  all  other  fsn>f(r)  follow  from  and  (33)  or  (34).  The  relation  for  13s  is 

(  M  r  , 

B<-  — s —  32  block  r  bits  used  to  code  (o')  ,  X'  •  •  •  X'  and  P'  •  ■  •  p'  (41) 

N“M  r  ll  1  N  1  NJ 


Test  Results 

I  he  algorithm  is  tested  on  two  satellite  weather  images  and  a  site  image.  Image  specifications  are  given 
in  Table  2,  and  performance  -esults  for  MSE  compared  with  the  bits  per  pixel  B ^  +  Bg  are  presented  in 
Table  3.  Selected  original,  reconstructed,  and  difference  images  are  shown  in  Figs.  2,  3,  and  4. 

The  contribution  to  I3g  as  the  result  of  coding  the  (o')4-  is  log>K/256  bits  per  pixel,  which  is  0.019  for  the 
weather  images  (i.e.,  K  32,  C  1.25)  and  0.023  for  the  site  image  (i.c.,  K  64,  C  1.22).  The  con¬ 
tribution  to  Bg  is  0.  375  bits  per  pixel  as  the  result  of  coding  the  \7s  and  B's  without  a  variable  word  length 
code.  Simulated  reductions  were  obtained  using  the  Fano  code  of  Table  1.  For  weather  imago  1,  the  con¬ 
tribution  to  Bg  was  reduced  to  0.  285;  for  weather  image  2,  to  0.  307;  and  for  the  site  image,  to  0.277. 

These  results  are  reflected  in  Table  3. 


For  a  given  bits  per  pixel,  global  distortion  control  provides  a  lower  mean  square  error  than  local  bit 
control  as  may  be  seen  by  comparing  the  first  and  second  cases  for  weather  image  2  given  in  Table  3.  A 
characteristic  of  local  bit  control  is  that  the  difference  image  will  have  "blocking"  reflecting  the  different 
distortions  on  blocks  with  different  statistics.  This  effect  can  be  seen  in  the  difference  image  shown  in 
Fig.  2  for  weather  image  1. 

In  Figs.  2,  3,  and  4  the  reconstructed  images  are,  to  the  eye,  replications  of  the  originals.  Thus,  to 
accentuate  differences  between  the  original  and  reconstructed  images,  an  amplified  difference  image  is 
shown  representing  30|b(k,i)  -  f>(k,  i)|  with  white  cor  responding  to  higher  difference  than  black.  Note  that 
the  difference  images  in  Figs.  3  and  4  have  the  pepper  appearance  that  indicates  the  fairly  uniform  distortion 
over  the  image,  which  is  characteristic  of  global  distortion  control. 


Conclusions 

A  transform  coding  algorithm  has  been  developed  based  on  a  generalized  correlation  model  of  the  image 
blocks.  The  algorithm  requires  coding  of  the  transform  coefficients  plus  the  overhead,  which  is  the  bits 
per  pixel  required  to  code  the  transform  statistics.  The  developed  algorithm  is  characterized  by  a  low  mean 
square  error  between  original  and  reconstructed  images  for  the  bits  per  pixel  allocated  to  code  the  transform 
coefficients  without  excessive  overhead.  Among  various  possible  schemes  for  coding  the  transform  statis¬ 
tics,  an  eight-level  quantizer  followed  by  a  variable  word  length  code  was  utilized.  For  all  tests  the  re¬ 
constructed  images  are  very  good,  supporting  the  reasonableness  of  the  product  separable  model  for  the 
transform  coefficient  variances.  The  model  for  the  transform  coefficient  variances  is  sufficiently  general¬ 
ized  so  that  high  frequency  image  structure  can  he  reflected.  Computation  of  the  quantized  transform  co¬ 
efficient  variances  prior  to  coding  the  transform  coefficients  permits  control  of  the  bit  rate  since  the  number 
of  bits  allocated  to  each  block  is  known  prior  to  coding. 
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Fig.  1.  Basic  structure  of  statistical  transform  coding. 
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Fig.  2.  Local  bit  control  for  weather  image  1. 
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Table  1.  Bit  Allocation  Rule  for  Coding  of  Statistical  Coefficients  Using  Kano  Code 
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Table  2.  Image  Specifications 


B 

o 

Bits  /  Pixel 

Image 

Dimensions 

No.  of 
Blocks(M) 

Global  Sample 
Mean(m ) 

1 

energy  per  1 
Pixel  (ENGY)  1 

Weather  Image  1 

6 

480  V  704 
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30.  7b 

191. 2b 

Weather  Image  2 

6 

480  X  704 
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as.  88 

23. 70 

Site  Image 

10 

5 1  a  v  SI  1 

— 
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306.00 

72b9. 

_ 1 

Table  3.  Performance  Results 


ns 

nA  f  BS 

MSE  x  100 
(%) 

Bit  Allocation 
Rule 

Weather  Image  ! 

0.  187 

0.  304 

0.491 

1.47 

GDC 

=  6 

0.  827 

0.  304 

1.131 

0.  so 

GDC 

1 .  S7S 

0.  304 

1.879 

0.  2  3 

LBC 

Weather  Image  2 

o.  asb 

0.  32b 

0.  582 

l.bS 

GDC 

B  b 

0.  306 

0.  32b 

0.632 

1.82 

LBC 
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Site  Image 
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0.  St  s 

4.  S3 

GDC 

B  10 
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a.  loa 
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